41 research outputs found

    Accelerated focused crawling through online relevance feedback

    Get PDF
    The organization of HTML into a tag tree structure, which is rendered by browsers as roughly rectangular regions with embedded text and HREF links, greatly helps surfers locate and click on links that best satisfy their information need. Can an automatic program emulate this human behavior and thereby learn to predict the relevance of an unseen HREF target page w.r.t. an information need, based on information limited to the HREF source page? Such a capability would be of great interest in focused crawling and resource discovery, because it can fine-tune the priority of unvisited URLs in the crawl frontier, and reduce the number of irrelevant pages which are fetched and discarded

    The structure of broad topics on the web

    Get PDF

    Real-time augmented reality filters expressive of user sentiment

    Get PDF
    Body language and facial expressions are an important component of human communication. Some messaging applications include features to send emoji, animated GIFs, etc. to express emotion. However, such content does not include the user’s image. This disclosure describes techniques that enable users to choose augmented reality effects that are added to a user’s image and that help users express an emotion

    Clump: A scalable and robust framework for structure discovery

    No full text
    We introduce a robust and efficient framework called CLUMP (CLustering Using Multiple Prototypes) for unsupervised discovery of structure in data. CLUMP relies on finding multiple prototypes that summarize the data. Clustering the prototypes enables our algorithm to scale up to extremely large and high-dimensional domains such as text data. Other desirable properties include robustness to noise and parameter choices. In this paper, we describe the approach in detail, characterize its performance on a variety of datasets, and compare it to some existing model selection approaches.
    corecore